Web crawling

Web crawling
автоматическое сканирование (просмотр) Web
служит средством сбора и предоставления самых актуальных данных; используется, например, для копирования всех посещённых веб-страниц, с тем чтобы ускорить их поиск впоследствии. Позволяет автоматизировать такие функции, как сбор особых видов информации, например адресов электронной почты. В настоящее время Интернет содержит громадные объёмы данных (web data), причём даже самые мощные поисковые системы индексируют сравнительно небольшую их часть; поэтому особенно важно, чтобы те страницы, которые выбирает поисковый агент (crawler), были наиболее релевантными, а не представляли собой просто случайную выборку веб-страниц.
Syn:

Англо-русский толковый словарь терминов и сокращений по ВТ, Интернету и программированию. . 1998-2007.

Игры ⚽ Нужен реферат?

Смотреть что такое "Web crawling" в других словарях:

  • Distributed web crawling — is a distributed computing technique whereby Internet search engines employ many computers to index the Internet via web crawling. Such systems may allow for users to voluntarily offer their own computing and bandwidth resources towards crawling… …   Wikipedia

  • Web archiving — is the process of collecting portions of the World Wide Web and ensuring the collection is preserved in an archive, such as an archive site, for future researchers, historians, and the public. Due to the massive size of the Web, web archivists… …   Wikipedia

  • Crawling — is a form of animal locomotion generally involving slow movement along the ground, such as that seen in snakes, snails and earthworms. Various mechanisms are involved, for example earthworms move by peristalsis, while snakes undulate their body… …   Wikipedia

  • Web harvesting — is an implementation of a Web crawler that uses human expertise or machine guidance to direct the crawler to URLs which compose a specialized collection or set of knowledge. Web harvesting can be thought of as focused or directed Web… …   Wikipedia

  • Web template (disambiguation) — Web template may refer to:* Web template, web site design templates * Website Parse Template, web site structured content description for web crawling …   Wikipedia

  • Web crawler — For the search engine of the same name, see WebCrawler. For the fictional robots called Skutters, see Red Dwarf characters#The Skutters. Not to be confused with offline reader. A Web crawler is a computer program that browses the World Wide Web… …   Wikipedia

  • Web search engine — Search engine redirects here. For other uses, see Search engine (disambiguation). The three most widely used web search engines and their approximate share as of late 2010.[1] A web search engine is designed to search for information on the Wo …   Wikipedia

  • Web search query — A web search query is a query that a user enters into web search engine to satisfy his or her information needs. Web search queries are distinctive in that they are unstructured and often ambiguous; they vary greatly from standard query languages …   Wikipedia

  • Web traffic — is the amount of data sent and received by visitors to a web site. It is a large portion of Internet traffic. This is determined by the number of visitors and the number of pages they visit. Sites monitor the incoming and outgoing traffic to see… …   Wikipedia

  • Crawling (disambiguation) — Crawling has several meanings:* Crawling, a form of locomotion by some animals, especially insects but also, in some cases humans * Crawling (song) , a song by the nu metal/rap metal band Linkin Park. * A web crawler is a software application… …   Wikipedia

  • Web scraping — (sometimes called harvesting) generically describes any of various means to extract content from a website over HTTP for the purpose of transforming that content into another format suitable for use in another context. Those who scrape websites… …   Wikipedia


Поделиться ссылкой на выделенное

Прямая ссылка:
Нажмите правой клавишей мыши и выберите «Копировать ссылку»